💾 Cache-Oblivious Algorithms - abnv · Scour

Iteratively optimizing an SPSC queue 🎯Ring Buffers

blog.c21-mac.com·3d·r/cpp·

Donald Raab: Measuring the Startup Memory Cost for Lazy Iteration Patterns in Java 🗑️Garbage Collection

donraab.medium.com·2d·

Stack vs malloc: real-world benchmark shows 2–6x difference 📚Stack Data Structures

blog.stackademic.com

·23h·DEV·

Metal Quantized Attention: pulling M5 Max ahead with Int8 matrix multiplication 🗺️Region Inference

releases.drawthings.ai·18h·Hacker News·

Finding performance bottlenecks with Pyroscope and Alloy: An example using TON blockchain 🔗Hash Algorithms

grafana.com·2d·

Why I’m Building a Database Engine in C# 🗃️Query Compilation

nockawa.github.io·5d·Hacker News·

facebookincubator/dispenso: The project provides high-performance concurrency, enabling highly parallel computation. ⏱️Async Runtimes

github.com·7h·Hacker News·

Accelerate CPU-based AI inference workloads using Intel AMX on Amazon EC2 🗺️Region Inference

aws.amazon.com·2d·

'Performance without compromise': AMD debuts first dual 3D V-Cache Ryzen CPU in potential showdown against Threadripper and EPYC siblings 🎯CPU Dispatch

techradar.com

·1d·

Taming the JVM Latency Monster 📊Memory Profilers

dzone.com·6d·

Speculative Decoding: Performance or Illusion? 🗺️Region Inference

specdecode-bench.github.io·5d·Hacker News·

Stop obsessing over your GPU's core clock — memory clock matters more for local LLM inference 🗺️Region Inference

xda-developers.com·4d·

Shaun Thomas: PG Phriday: Absorbing the Load 🗑️Concurrent GC

pgedge.com·6d·

Systematic Analysis of CPU-Induced Slowdowns in Multi-GPU LLM Inference (Georgia Tech) 🗺️Region Inference

semiengineering.com·5d·

Finding-Fortune/Binary-Cellular-Automata: The Cellular Automata algorithm for cave generation computed with binary operations for a massive performance speed-up. >10x faster than other noise libraries at cave generation. ⚡Cache-Aware Algorithms

github.com·1d·r/proceduralgeneration·

Discord Engineers Add Distributed Tracing to Elixir's Actor Model Without Performance Penalty ✨Gleam

infoq.com·5d·

From 21 Seconds to 759ms: Engineering a High-Performance Log Aggregation System 📮Persistent Queues

medium.com·3d·

Postgres Performance: Why Peak Throughput Benchmarks Miss the Real Problem 🗄️Database Engines

tigerdata.com·5d·

The Synthesis Problem: Why I’m Building a New Logic Toolchain 🔬Nanopasses

llama.gs·6d·Hacker News·

Intel Ultra 5 250K Plus CPU Review & Benchmarks: Gaming, Production, & Power Consumption 🔮Speculative Execution

gamersnexus.net·6d·

Loading more...